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DERIVATION  OF  THE  GENERALIZED,  AVERAGE  EUCLIDEAN 
DISTANCE  FUNCTION  FOR  THE  PDI  MODEL 


INTRODUCTION 

Research  has  demonstrated  spatial  density  (or  crowding)  to  be  a  significant  stressor  in 
animal  and  human  populations  (Galle,  Grove,  and  McPherson,  1972;  Baum  and  Epstein,  1975). 
In  previous  papers,  the  author  formulated  and  tested  a  mathematical  model  and  methodology  for 
measuring  discrete  spatial  density  in  human  populations  (O'Brien  (1989, 1990a,  1990b)) .  The 
model,  called  the  population  density  index  (PDI)  model,  was  demonstrated  to  provide  a  more 
accurate  and  flexible  approach  for  discrete  spatial  density  measurement  than  the  conventional 
formulation.  The  traditional  approach  to  measuring  human  physical  density  involves  two 
parameters:  the  number  of  persons  (n)  and  the  geometric  area  (A)  in  which  the  persons  dwell. 

The  equation  D  =  n/A  serves  as  the  conceptual  and  computational  definition  for  “density,” 
“congestion,”  “population  density,”  or  “physical  crowding,”  each  term  used  interchangeably.  In 
contrast,  the  PDI  model  is  based  on  three  parameters:  n,  A,  and  inter-object  distance.  The 
derivation  of  the  PDI  model  metrics  is  patterned  on  the  “square-root  law”  of  average  distances 
used  in  the  physical  sciences.  The  capability  to  model  inter-object  distance  within  a  defined 
geometric  plane  is  a  significant  enhancement  to  discrete  spatial  density  measurement  In  O'Brien 
(1991a),  the  PDI  model  was  generalized  to  any  finite  number  of  density  points  (i.e.,  people). 

The  motivation  for  developing  the  PDI  formula  and  model  was  the  need  to  be  able  to 
measure  crowding  among  people  from  variable  spatial  configurations  such  as  in  a  typical 
dynamic  workplace  environment.  The  conventional  density  model  assumes  that  a  static 
description  is  adequate  without  taking  into  account  the  way  in  which  people  use  an  environment 
over  time. 

The  PDI  model  has  been  used  at  the  Naval  Undersea  Warfare  Center  (NUWC)  Division, 
Newport,  for  density  measurement  (O’Brien  and  Kanter,  1988;  Kanter  and  O’Brien,  1989a; 
1989b)  in  submarine  attack  center  concept  of  operations  experiments  (Wallin,  1987).  Practical 
applications  of  the  PDI  model  resulting  from  research  at  NUWC  have  been  documented  for  a 
variety  of  disciplines  in  several  U.S.  patent  applications  (O'Brien,  1991c,  1991d,  1991e,  1991f). 

The  purpose  of  this  report  is  to  provide  a  more  rigorous  derivation  of  the  PDI  model  than 
currently  exists.  The  basis  of  the  PDI  model  is  the  distance  function  in  Euclidean  space.  All  of 
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the  measures  in  die  model  are  related  to  distance.  Thus,  an  attempt  is  made  to  characterize  the 
PDI  distance  function  in  R2  (two-dimensional  Euclidean  space). 


DERIVATION  OF  THE  DISTANCE  FUNCTION 


GENERAL  CASE  LATTICES 


The  notation  and  structure  of  this  section  is  patterned  on  Money  (1962,  Chapter  8,  “The 
Definite  Integral”),  where  the  theory  of  area  and  concept  of  functional  uniform  continuity  are 
developed  in  detail.  Also,  the  ideas  of  inner  and  outer  areas  of  bounded  sets  and  the  idea  of  a 
planar  figure  developed  in  Money  are  germane  to  the  present  development 


In  the  X-Y  Euclidean  plane  (quadrant  I)  of  figure  1,  any  two  consecutive  abscissa 
(horizontal)  or  ordinate  (vertical)  points  (denoted  by  a  large  dot  •)  are  assumed  to  be  equidistant 
with  interpoint  spacing  parameter  5.  That  is,  the  directed  distances  of  the  collinear  point  pairs 
(Plp2)  =  [(xfc.  y/M*k+l.  y/B  and  (P3p4)  =  [(xm,  yj)(xm,  y^jJ  are 


plp2  =  |xk+i  *  xj  =  8 , 


p3p4  =  jyj+i  -  yj  |  =  8 , 


(1) 


where  Xk  is  a  representative  abscissa  and  yj  is  a  representative  ordiaate;  (xk,  yp  >  0.  Generally, 
xk,  yj  will  not  be  lattice  (integer)  points.  In  this  report  the  units  of  die  interpoint  distance 
parameter  5  for  human  populations  are  assumed  to  be  feet  (8  2  1). 


The  interior  rectangular  lattice  shown  in  figure  1  consists  of  n  (a  nonprime  number)  finite 
points  arranged  uniformly  with  R  row  (horizontal)  and  C  column  (vertical )  points  such  that 
n  =  RC  (n  £  2).  The  selection  of  an  RC  configuration  and  the  computation  of  5  are  explained  in 
appendix  A. 


2 


Figure  1.  General  Case  Lattice 

For  a  representative  region  3  bounded  by  the  nonnegative  curves  (see  figure  1) 
fA(x)  =  (j  +1)Y/C 
gA00  =  jY/C ,  gA(x)  <  fA(x) , 

IcX/RSxS  (k+l)X/R,  0  <;  k  £  R-1,0  £  j  £  C-l, 
the  area  A(3)  is  defined  as 

( k+l  )X/R 

A(3)  =  J  [fA(x)  -  gA(x)]dx , 

kX/R 


=  XY/RC  =  A/n , 


which  is  seen  to  be  a  rectangle.  For  human  populations  with  feet  as  the  linear  units,  die 
restriction  will  be  placed  on  the  value  of  A/n;  viz.,  A  £  n.  When  A  =  n,  the  RC  rectangular  or 
square  uniform  discrete  distribution  is  referred  to  as  a  "unit  lattice";  otherwise,  the  homogeneous 
distribution  of  points  is  called  a  “nonunit  lattice.”  The  distinction  will  be  understood  in  context 
Each  such  rectangle  will  be  obtained  by  dividing  die  total  study  area  n[A(3)]  into  n  =  RC 
partitioned  rectangles  each,  with  area  given  by  equation  (3). 

The  connected  density  points  in  each  of  the  horizontal  and  vertical  intervals  are  defined 
by  relations  (or  multiple-valued  discrete  constant  functions): 

f(R)=  X  =  (R- 1)8  +  p,  p >0, 

(4) 

f(C)=  Y  =  (C - 1) 8  +  q,  q >0. 


Equations  (4)  indicate  that  each  X  or  Y  interval  consists  of  two  components:  the  length  of  the 
density  points  segment  [(R  -  1)8  or  (C  - 1)8]  and  an  excess  factor  (p  or  q).  The  region  outside  the 
perimeter  of  the  uniform  point  arrangement  [equal  to  A-  (R-  1)(C  -  1)82]  is  required  to 
accommodate  environmental  objects  (furniture,  equipment,  displays,  etc.).  Each  of  the  CX 
intervals  and  RY  intervals  is  defined  by  the  constant  functions  in  equations  (4).  The  interval  X 
will  be  partitioned  into  R  subintervals,  each  subpartition  of  which  will  have  the  length  shown  in 
figure  1,  and  Y  will  be  similarly  divided  and  have  die  length  shown  in  figure  1. 


The  derivation  of  the  coordinate  system  for  the  general-case  lattice  will  allow  a  precise 
graph  to  be  drawn  of  any  uniform  rectangular  distribution  on  a  rectangular  Cartesian  X-Y 
coordinate  system  such  that  the  interior  RC  lattice  is  contained  within  the  XY  exterior  region. 
The  coordinates  of  the  density  points  derived  from  equation  (4)  will  be  generated  by 

xk  «p/2  +  (k- 1)8,  1  £  k  £  R, 

(5) 

yj  *  q/2  +  (j  -  1)8 ,  1  £  j  £  C. 

Then,  rite  coordinate  system  for  the  general  case  will  be  defined  as 

(*k»  yj)=[(*i»  yi).  (*2>  yiX  •••»  (*k»  yj)>  •••>  (*r*  yc)] 


:[|-2M!+84- §+*-i*  h-w) . 

(|+(R-1)S,|+(C-1)8)]. 
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The  coordinate  system  of  equation  (6)  applies  to  either  a  unit  or  nonunit  lattice  because  it  is 
derived  from  the  general  case.  An  example  of  die  use  of  equation  (6)  is  depicted  in  figure  2. 
The  coordinates  were  generated  from  the  following  assumptions:  n  =  6;R  =  3,  C  =  2  (from 
equation  (A-4)  in  appendix  A);  A  =  X  H  Y  =  16  x  6;  p/2  =  4,  q/2  =  1  (from  (4));  8  =  4  (from 
equation  (A-5)  in  appendix  A).  Then  the  coordinate  points  are  generated  by  x*  =  4  +  4(k  - 1); 
yj  s  l  +  4(j  - 1).  The  plot  points  are  obtained  from  all  k  x  j  combinations  (k  =  1, 2, 3;  j  =  1, 2). 

Y 
6 

5 

4 

3 

2 

1 

0 

Figure  2.  Example  of  General  Case  PDI  Graph 


SPECIAL  CASE  LATTICES 


Each  density  point  (•)  is  now  assumed  to  be  the  centroid  (center  of  mass)  bounded  by  its 
respective  planar  region  (see  figure  3).  Let  a  representative  region  be  called  S.  The  area  of  S 
can  be  determined  by  first  defining  the  nonnegative  curves  as  the  boundaries  of  S: 


fB<x)  =  (j  +  l)S, 

gB(x)  =  j8 ,  gB(x)  <  fg(x)  > 


(7) 


k8£x£<k+l)5,  0£k£R-l,  0£j'£C-l. 


The  special  case  of  equation  (7)  can  be  derived  from  equation  (2)  by  assuming  that 
8  =  p  =  q  =  X/R  =  Y/C  in  equation  (4)  of  the  general  case  (i.e.,  proportionate  commensurability 
between  the  dimensions  of  the  outer  and  inner  rectangular  areas). 
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The  area  of  S  is  then  found  by  integrating  between  the  curves  fB(x)  and  gB(x)  in  the  x 
interval,  and  applying  the  Fundamental  Theorem  of  Calculus: 

Ck+05 


A(S) 


■I 

kS 


[fe(x)  -  gs(x)]dx  =  8  , 


82  £  1 . 


(8) 


This  is  intuitively  the  area  of  a  square  figure.  The  figure  will  be  obtained  by  dividing  the 
total  area  n[A(S)  ]  into  n  =  RC  partitions  after  determining  which  lattice  configuration  will 
accommodate  best  the  n  points  into  a  rectangular  configuration  with  associated  interpoint  spacing 
parameter  8  (see  appendix  A).  Note  that  for  commensurate  (unit  or  nonumt)  lattices,  the 
interpoint  spacing  parameter  is  related  to  the  region  in  equation  (8). 

Y 


►  0+1)8 
18 

►  38 
-►  28 
-►  5 

Figure  3.  Special  Case  Lattice 
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Bers  (1969,  Vol.  II,  chapter  8,  section  8,  "Centroids  of  Plane  Regions  and  Curves”)  gives 
the  following  definitional  formulas  for  determining  the  coordinate  points  (x*,  yj)  of  the  centroid 
in  region  S: 


<*+1)8 

I  ■ 


x[fB(x)-gB(*)]dx 


(xk,yj):xk  =  — 


8(2k  +  1) 


(k+l)8 


I 


kS 


[fe(x)  -  gB(x)]dx 


(9) 


(k+l)8 

I  • 


1/2  [fe(x) 2  -  gB(x)2]dx 


Vv.  = 


(xk,yj):yj  = 


6(2j  + 1) 


(k+l)8  2 

[few  -gfi(x)]dx 


I 


kS 


Here,  (xj^  yj)  represents  the  rule  for  locating  all  and  every  density  point  (centroid)  in  the  entire 
XY  area,  given  concisely  as 


(*k.  yj) = [(f + (k  -  i)s).  (f + 0  -  i)s) 


= K*i>  y\l  (x2»  yiX  (x*  yi).  •.  (x*  yi)  * 

(xi>  y2X  •••*  (xk*  Y2l  •••»  (xk>  yj>  •••*  (xr»  yc)] 

=  f[s/2,  5/2],  [3S/2,  5/2],  [58/2,  8/2], [5(2R  -  1V2,  8/2] , 
[8/2, 38/2], ....  [8(2R  -  iy2, 38/2], .... 

[»2  +(k  - 1)8, 8'2  +0  - 1)5 .  [8[2R  -  iy2, 2(2C  - 1^2]) , 

1  £  k<£  R,  1  C. 
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The  coordinate  system  of  equation  (10)  applies  to  unit  lattices  and  commensurate  nonunit 
lattices.  Figure  4  is  an  example  of  equation  (10)  applied  to  a  3  x  2  unit  lattice  (8=1  from 
equation  (A-5)  in  appendix  A).  Note  that  X/R  =  Y/C  =  p  =  q  =  8=VXY/RC  =  1  because  all  unit 
lattices  are  commensurate.  The  graph  is  plotted  from  equation  (10)  by  xt  =  k  -  0.5; 
yj = j  -  0.5  (k  =  1,2,3;  j  =  1,2). 


25 
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0  1/2  3/2  5/2  X 


Figure  4.  Example  of  Special  Case  PDI  Graph  (Unit  Lattice) 

Figure  5  is  an  example  of  a  graph  for  a  nonunit  commensurate  lattice  with  n  =  15  points  within 
area  of  40  ft  x  24  ft.  Note  that  X/R  *  Y/C  =  p  =  q  =  8 = VXY/RC  =  8.  Plot  points  are  generated  from 
equation  (10):  x*  =  4  +  8(k  - 1);  yj  =  4  +  8(j  - 1). 
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Figure  5.  Example  of  Special  Case  PDI  Graph  ( Commensurate  Norumit  Lattice) 

GENERALIZED  DISTANCE  FUNCTION  IN  A  LATTICE 

Since  the  coordinate  points  in  the  PDI  lattice  can  now  be  specified  completely,  the  PDI 
“exact”  and  “approximate”  distance  formulas  can  be  derived  (O’Brien,  1990b,  1991b).  Here  is 
derived  the  generalized.  Euclidean  distance  formula  for  any  PDI  lattice  (nonunit  lattice  and 
thereby  the  unit  lattice  as  a  special  case)  and  any  nonuniform  distribution.  First  shown  is  the 
derivation  for  a  lattice  using  the  general  case  notation  system.  The  derivation  applies  equally  to 
the  special  case  by  assuming  commensurability. 

Let  any  density  point  in  the  X- Y  plane  be  called  (x^,  yj)  and  let  a  second  distinct  point  be 
called  (xk+i ,  y^  /).  Then,  from  equation  (6), 

(xk,  yp  =  (p/2  +  (k  - 1)5,  q/2  +  (j  - 1)6)  , 

(11) 

(xk+i’yj+/)  =  (p/2  +(k  +  i- 1)5,  q/2  +Q  +  /- 1)5)  , 

1  £  k  £  R - 1 ,  1  £  j  £  C-l  , 
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2  £  k  +  i  <;  R,  2  <;  j+/s  c. 

Bcrs  (1969,  Vol.  I)  shows  that  the  classical  Pythagorean  distance  formula  for  any  two 
points  in  a  Cartesian  plane  can  generally  be  derived  from  the  integral  calculus  arc  length  formula, 
given  for  our  notation  as 

£  =  f  Vl  +  [f’(x)]2dx.  (12) 

*k 


The  quantity  f  (x)  is  the  first  derivative  of  the  function  f(x),  taken  to  be  a  generalized  single¬ 
valued  relation  for  two  points  in  a  Euclidean  X-Y  plane  specified  by  the  first  degree  equation 
fix)  =  a  +  bx,  for  the  slope  intercept  a  and  linear  slope  b. 


Since  f '  (x)  =  =  Dx(a  +  bx)  =  b,  the  constant  slope  of  the  points  in  equation  (11) 

Ua 

can  be  defined  as  b  =  ~  ^J.  =  Then, 

xk+i "  xk  5j 


(13) 

=  bVi2  +  /2  , 

which  is  seen  to  be  of  the  form  for  the  standard  bivariate  Pythagorean  theorem  scaled  by  a 
constant: 


£  =  8 V  (xk+i  -  xk)2  +  (yj+/  -  yj)2  . 


(14) 


Bers  (1969,  Vol.  I,  p.  279)  terms  equation  (12)  the  “length  formula.”  It  may  also  be 
viewed  as  an  average  --  the  average  length  of  one  pair  of  points.  The  length  (distance)  between 
any  one  pair  of  points  in  the  uniform  RC  lattice  can  be  generalized  to  an  average  among  all 
possible  pairs  of  RC  points  since  each  point  pair  defines  a  simple  linear  function  each  of  which 
possesses  a  piecewise  continuous  first  derivative.  The  average  pair-to-pair  distance,  summed 
over  all  pairs  of  points,  will  be  the  average  of  all  the  line-to-line  curves  (total  length),  since  the 
connected  graph  defines  a  multiple-valued  relation  (Bers,  1969,  Vol.  I,  page  279).  That  is,  the 
uniform  average  distance  in  the  total  lattice  is 
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(15) 


X  Aj 

C(n,2) 


where 


C(n,2)  = 


(RC)t 


n(n  - 1) 


2!(RC  -  2)! 


(n2>2). 


(16) 


is  the  combinatorial  expression  specifying  the  total  number  of  nonredundant  pairwise-connected 
lines  from  n  nodes  and  the  exact  summation  index  limits  are  given  in  equation  (1 1).  The  uniform 
lattice  distance  equation  (15)  can  be  further  expressed  in  a  more  computational  convenient  form 
as 


d  =  5A. 


(17) 


where  8  is  given  in  appendix  A  and  A  is  the  unit  lattice  average  distance,  which  has  been 
derived  in  O’Brien  (1991a)  as 


D.i  r.i  _ 

12  X  I  (R - i) (C - j) Vi2  +  j2  +  RC(R2  +C?-2) 
i=l  j=l  _ 


3(RQ(RC  - 1) 


08) 


where  R  is  the  number  of  horizontal  points  in  each  row  of  the  unit  lattice,  C  is  the  number  of 
vertical  points  in  each  column  of  the  unit  lattice,  and  RC  is  the  total  number  of  density  points  in 
the  unit  lattice. 


An  accurate  approximation  to  equation  (18)  exists  when  n  is  not  small.  This  relation  is 
derived  under  the  assumption  that  there  is  a  continuous  uniform  distribution  within  a  rectangular 
plane.  The  objective  is  to  find  the  average  distance  between  any  two  randomly  selected  points  of 
a  convex  set  The  approximation  formula*  (Santalo,  1976,  formula  4.18,  page  49)  is  as  follows: 


(19) 


*  The  author  gratefully  acknowledges  an  anonymous  referee  of  The  American  Mathematical  Monthly  for 
suggesting  equation  (19)  (in  correspondence  related  to  O'Brien,  1990c). 
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where  d  =  V  R2  +  C2  and  In  is  the  natural  logarithm  operator. 

Calculations  have  shown  equation  (19)  to  be  a  good  approximation  to  equation  (18).  For 
example,  for  n  under  100,  the  maximum  discrepancy  is  less  than  10  percent  Equation  (19)  is  an 
interesting  example  where  a  continuous  distribution  relation  is  applied  to  a  discrete  distribution 
to  obtain  an  approximation  to  the  latter.  In  the  limiting  case,  as  RC  approaches  infinity,  the 
difference  between  equations  (18)  and  (19)  approaches  zero. 

In  conclusion,  for  any  finite,  discrete,  uniform  distribution  with  distance  between  any  two 
points  S,  the  generalized  average  Euclidean  distance  in  any  PDI  lattice  among  all  possible  pairs 
of  RC  points  will  bed  =  5A  or  8  A'  *>  d.  If  a  unit  lattice  $  =  1),  then  d  =  A  or  A'  *>  d.  Selected 
values  of  A  calculated  from  equation  (18)  are  given  in  appendix  B  for  all  RC  configurations 
from  RHC  =  2H2toRHC  =  100  H  100  (n  =  10,000  density  points). 


GENERALIZED  DISTANCE  FUNCTION  IN  A  NONLATTICE 


Here,  density  points  can  fall  anywhere  within  the  X-Y  geometric  area,  subject  to 
restrictions  specified  earlier.  The  average  Euclidean  distance  is  calculated  by  equation  (15)  from 
known  coordinate  points  as 


I  V  (Xj  -  xj)2  +  (yj  -  yj)2 

d  =£i_ - 

C(n,2) 


(20) 


where  (xi,  yO,  (xj,  yj)  (i  =  1, 2, ....  n;  j  =  1, 2, ....  n)  denote  the  coordinate  locations  for  the  density 
points  (n  >  1)  within  the  rectangular  study  area  X  K  Y  with  arbitrary  origin  O.  Equation  (20)  is 
the  form  used  for  calculating  the  population  density  measure  of  observed  density  points  (i.e., 
PDIjct,  defined  below  in  equation  (29)). 

An  approximation  to  equation  (20)  is  useful  because  exact  (xj,  yj),  (xj,  yj)  coordinates 
cannot  always  be  obtained.  Recently,  O’Brien  (1991b)  derived  an  approximation  PDI  method  by 
assuming  knowledge  of  the  relative  location  of  the  density  objects  when  (xi,  yi),  (xj,  yj)  data  were 
unavailable. 

If  one  assumes  that  the  study  area  A  =  X  K  Y  has  been  partitioned  into  n  =  RC  rectangles, 
each  with  subarea  given  by  equation  (3),  then  the  following  abbreviated  calculation  routines  can 
be  derived. 
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Define  a  cell  density  measure. 


Djk  =  njk/Ajk,  (2i) 

where  njk  is  the  number  of  objects  observed  to  be  within  each  of  the  subareas 

Ajk  =  A/n  (j  =  1, 2 . R;  k  =  1, 2, ....  C),  O^njk  £  Ajk,0£Djk£  1.  Then,  define  a  cell 

indicator  variable  I: 


Let 


_l  1  if  Djk*  0 , 
Ijk"|0  ifDjk  =  0. 


m  =  X  £ljk.  nD  £  m  £  n , 
k=l  j=l 


(22) 


(23) 


where  D  =  n/A  is  obtained  from  equation  (21)  as  an  average  cell  density  with  weights  spread 

R  C 

over  all  cells;  i.e.,  D  =  £  X  Djk  n-1 .  The  measure  m  represents  the  total  number  of  RC 

k=l  j=l 

partitions  occupied  by  at  least  one  object.  In  practice,  n^  is  taken  as  the  smallest  integer  value. 
Likewise,  m  is  taken  to  be  the  largest  integer  value. 

Hence,  equation  (17)  can  be  redefined  to  give  the  following  approximation  to  equation 

(20): 

d'  =  5'efrA,  (24) 

1  S5'eff  £D'1/Z  (25) 

8’eff  is  obtained  from  equation  (21)  as  an  average  cell  density  with  weights  spread  over  only  the 
m  occupied  cells.  The  limits  of  equation  (24)  follow  immediately  by  substituting  the  lower  and 
upper  limits  of  m  given  in  equation  (23);  viz.,  A  £  d’  £  A  YA/n .  Noting  that  1  £  8  £  YA/n  (see 
appendix  A)  and  assuming,  in  practice,  that  1  £  8'eff  £  8,  it  then  follows  that 

dmax  £d'  ^  dmin  ,  (26) 
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where  dmjn  and  dm»r  are,  respectively,  the  lower  and  upper  distance  measures  in  the  exact  PDI 
model  (O’Brien,  1990b).  The  relationship  of  (26)  translates  directly  into  a  proof  of  the  bounds  of 
the  approximate  PDI  measure  (PDI* act  =  VD/d'  =  [DVn/m  ]/A);  i.e.,  PDI* act)  is  bounded  by  the 
PDImin  and  PDImax  relations  defined  in  O’Brien  (1990b)  and  in  equations  (27)  and  (28)  below. 

ALTERNATIVE  DISTANCE  MODELS 

Thus  far,  the  distance  function  has  been  derived  for  a  rectangular  configuration  of  points 
by  assuming  a  rectangular  exterior  region.  Mathematically,  there  is  good  reason  for  doing  this 
because  a  square  or  rectangle  can  be  drawn  around  any  closed  curve  (Steinhaus,  1969). 

Occasionally,  the  environment  of  interest  may  be  modeled  by  curved  configurations  such 
as  ellipses  or  circles,  the  latter  being  the  easier  to  work  with.  Circular  distributions  have  two 
advantages.  First,  for  regions  nearly  square,  a  circle  offers  a  more  compact  concentration  of 
points,  which  may  provide  more  realistic  bounds  on  the  density  measure  for  highly  cluttered 
environments.  Second,  any  number  n  of  points  (including  prime  numbers)  can  be  placed 
uniformly  on  a  circle  of  radius  r  with  linear  point-to-point  distance  d  =  2r  sin(180/n).  Based  on 
this  chord  length  measure,  the  author  recently  constructed  a  PDI  model  for  discrete  spatial 
density  for  circular  distributions  (O'Brien,  1992). 
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SELECTED  PROOFS 


PROOF  THAT  PDImin  *  PDIact  *  PDImax 


First,  a  statement  of  the  relationships  involved  in  this  proof  is  given  as  follows: 

Lower  bound:  PDImin  =  » 

8A  v  A 

Upper  bound:  PDImax  =  ^y^, 

Actual  PDI:  PDI^t  =  =4-  . 

dact  V  A 


(27) 

(28) 
(29) 


The  terms  n.  A,  A,  and  dact  (equivalent  to  equation  (20))  are  used  here  as  defined  in  this  report;  8 
is  defined  in  appendix  A. 


Now,  to  the  proof.  From  the  relationships  of  equations  (27),  (28),  and  (29),  a  formal 
statement  of  the  relationship  to  be  proven  is  as  follows: 


oVa‘£Va 


(30) 


To  prove  that  equation  (30)  is  a  true  statement,  three  assumptions  are  required: 


dgct  >  8.  tuid  A  are  measured  in  linear  units  of  feet. 

(31) 

8  >  1, 

(32) 

dact  —  8A . 

(33) 

The  first  assumption  (31)  is  self-explanatory.  The  second  assumption  (32)  is  deemed 
reasonable  because  it  amounts  to  saying  that  if  persons  are  positioned  uniformly  the  head-to-head 
distance  (5)  is  about  1  foot  Although  (32)  would  not  be  a  reasonable  assumption  for  areal  units 
of,  say,  square  miles,  (32)  is  reasonable  when  the  areal  units  are  square  feet  (See  O’Brien, 

1991f,  for  the  finite  “macro”  PDI  model  when  areal  units  are  square  miles.) 

The  third  assumption  (33)  states  that,  for  a  given  geometric  area  to  be  studied  in  a  density 
analysis,  the  actual  clustering  of  the  density  points  (i.e.,  people)  in  that  area  (with  associated 
density  dact )  will  not  be  greater  than  the  maximum  theoretical  dispersion  provided  by  the 


15 


relation  5A  (equation  (17)).  The  region  outside  5A  is  assumed  to  contain  physical  objects  such  as 
furniture,  equipment,  etc.,  making  it  unlikely  that  density  points  will  be  observed  in  that  region. 
Empirical  evidence  from  Monte  Carlo  simulations  in  O'Brien  (1989)  is  cited  in  support  of  (33). 

In  effect,  (33)  assumes  that  the  persons  are  maximally  dispersed  in  accord  with  the  relation  8A  . 
Figure  6  describes  the  essential  meaning  of  (33). 


0  X 

figure  6.  Intuitive  Justification  for  Third  Assumption 


The  formal  proof  of  equation  (30)  can  now  be  given  in  detail.  The  proof  is  presented  in  three 
parts.  The  first  part  states 

ssVa^Va-  (34> 

Simplifying  and  rearranging  the  terms  of  equation  (34)  gives  the  following  relationship: 

dact  ^  SA ,  (35) 


which  follows  directly  from  (33). 


The  second  part  of  the  proof  states  that 


Simplifying  and  rearranging  the  terms  of  equation  (36)  gives  the  following  relationship: 

A  ^  dact .  (37) 


From  (33)  the  following  relationship  can  be  established: 

A 

from  which  it  can  be  deduced  that 


(38) 

(39) 


Since,  by  (32),  it  follows  that  1/5  ^  1,  then  it  can  be  deduced  that  A  /d^  $  1,  from  which  it 
follows  that  A  <■  dj^t. 


The  third  pan  of  the  proof  asserts  that 


(40) 


The  relationship  between  the  lower  and  upper  limits  of  equation  (40)  follows  necessarily  from 
the  proofs  given  for  equations  (34)  and  (36)  by  the  transitivity  property  of  relations.  It  can  be 
readily  seen  that  equation  (40)  reduces  algebraically  to  8  2:  1,  which  follows  directly  from  (32). 
Thus,  the  statement  of  equation  (30)  has  been  shown  to  be  true  as  derived  from  the  stated 
definitions  and  assumptions. 

The  proof  that  the  approximate  PDI  formula  is  bounded  by  the  minimum  and  maximum 
bounds  given  in  equations  (27)  and  (28)  follows  from  equation  (26)  and  from  the  definition  of 
the  approximate  PDI  measure. 
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PROOFS  FORSeff 


From  O’Brien  (1990b,  equation(9)),  8eff  is  defined  as 

8rff=^. 


(41) 


The  objective  is  to  show  that  8eff  £  1.  Since  ^1  £  1,  as  proven  from  equation  (36),  equation 

A 

(41)  follows. 

The  proof  that  5/Seff  £  1  is  as  follows: 

By  definition,  5eff  =  then,  8/8eff  =  J—  2:  1,  which  follows  because  it  reduces  to 
A  dact 

5A  2  dact ,  which  was  established  previously  in  (33). 

Because  the  quantity  5^  is  a  “pure  number”  (i.e.,  it  has  no  dimensions  because  they  cancel 
out  as  in  the  above  definition),  it  provides  a  pure  measure  of  relative  change  in  population  density. 

The  reader  may  also  note  that  in  the  approximation  model  8’eff  2  1  and  £  1 
follows  from  the  derived  limits  given  in  equation  (25)  and  the  relationship  given  in  equation  (26). 
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SUMMARY 


This  report  has  presented  derivations  of  various  distance  functions  that  relate  to  the 
author's  three-parameter  square-root  model  for  measuring  discrete  spatial  density  in  finite 
populations.  The  model,  called  the  Population  Density  Index  (PDI)  model,  was  developed  to 
capture  dynamic  density  relations  among  persons  within  a  naturalistic  environment  An  “exact” 
model  and  an  “approximate”  model  were  presented. 

The  derivations  related  a  generalized  Euclidean  distance  function  to  the  fundamental 
measures  in  the  model  (PDIact,  the  approximation  measure  PDI' &ct>  their  lower  and  upper 
bounds,  and  the  density  rate  indices  8eff  and  8'eff ).  Coordinate  systems  were  derived  for  plotting 

graphs  of  the  PDI  lattices  and  calculating  the  distance  measures. 

Also  derived  was  the  algorithm  required  to  select  a  conformal  lattice  and  the  average 
uniform  distance  among  the  lattice  points  based  on  the  number  of  density  points  to  be  analyzed 
within  the  reference  quadrilateral  area. 

Average  Euclidean  distance  values  (A)  were  presented  for  unit  lattices  up  to  a 
100  x  100  matrix.  Using  these  values,  researchers  will  be  able  to  compute  lower  and  upper 
bounds  of  the  PDI  measures  for  up  to  10,000  density  objects. 


19/20 

ReverseBlank 


APPENDIX  A 

SELECTING  A  UNIT  LATTICE  AND  INTERPOINT  DISTANCE  PARAMETER 


DERIVATION  OF  THE  ALGORITHM 

In  this  appendix,  the  algorithm  is  presented  for  (1)  determining  a  unique  finite,  discrete, 
conformal  RC  lattice  and  (2)  computing  the  average  interpoint  distance  among  the  RC  points. 

To  begin,  it  is  assumed  that  n  (sample  size)  and  A  =  X  K  Y  (the  outer  rectangular 
geometric  area)  are  known.  If  n  is  a  prime  number  (like  5  or  13  or  29),  augment  n  by  1  before 
determining  the  rectangular/square  dimensions  of  the  unit  lattice.  The  derivation  of  the 
algorithm  for  selecting  an  RC  lattice  is  developed  from  concepts  of  number  theory  (Ore,  1967). 

In  particular,  interest  is  centered  on  sets  and  subsets  of  composite  numbers  that  can  be  expressed 
as  rectangular  or  square  integers;  i.e.,  positive  (nonprime)  integers  drat  are  two-integer  products. 

The  value  of  n  can  be  expressed  in  terms  of  the  prime  factors  of  the  whole  number; 

n  =  IIpjaj’  (A_1) 

>=1 

where  Pj  represents  the  jth  prime  number  and  a,-  is  the  number  of  occurrences  of  the  jth  prime 
number  of  n.  For  example,  composite  60  can  be  decomposed  into  P1aiP2a2P3a3  *  22  x  3  x  5. 
Next,  it  is  desired  to  derive  the  total  number  of  possible  RC  (n  =  RxQ  product  configurations 
of  n  in  order  to  create  the  set  of  RC  configurations;  the  latter  will  be  a  subset  of  the  former.  This 
number  can  be  derived  as  follows. 

Let  x(n)  represent  the  number  of  all  possible  configurations  of  a  composite  integer  n. 
Then  it  can  be  shown  that  this  quantity  is  obtained  from  equation  (A-l)  by 
r 

X(n)  =n  (aj  +  1)  •  (A-2) 

j=l 

For  example,  60  can  be  partitioned  into  (2  +1)(1+1)2  =  12  two- integer  products. 

Next,  the  set  of  the  x(n)  configurations  is  examined  to  select  only  those  nontrivial  and/or 
nonredundant  configurations.  Let  d>(RC)  represent  the  total  number  of  nomedundant  and 
nontrivial  R  H  C  configurations  for  composite  n,  x(n)  3  0(RQ.  The  trivial  configurations  are 
those  for  which  n  =  nxlorlxn,  and  the  redundant  configurations  are  the  multiplicative, 
commutative  equivalents  of  R  K  C;  i.e.,  RHC  =  CHR  (R  £  C)  (e.g.,  10  x  4  =  4  x  10).  Then, 


A-l 


(A-3) 


0(RC)  = 


x(n)  -  2  +  S 
2 


where  S  =  0  when  n  is  a  rectangular  number,  and  S  =  1  when  n  is  a  square  number.*  The  set  of 
all  such  specified  configurations  is  denoted  P  of  size  <D(RC)  =  m;  P  =  {RjCj,  R2C2» ....  RiQ. .... 
RmCn,},  (Rj  ^  Q).  For  example,  if  n  =  60,  then  C>(RC)  =  [(3  x  2  x  2)  -  2  +  03/2  =  5; 

P  =  {30  x  2, 20  x  3, 15  x  4, 12  x  5, 10  x  6}.  Note  that  the  trivial  (60  x  1, 1  x  60)  and  redundant 
commutative  equivalent  configurations  (2  x  30, 3  x  20, 4  x  15, 5  x  12, 6  x  10)  have  been 
eliminated  from  P.  Likewise,  for  n  =100, 0(100)  =  0(22  x  52)  =  [(3  x  3)  -  2  +  1  )]/2  =  4; 

P  =  {50  x  2, 25  x  4, 20  x  5, 10  x  10}. 


Selection  of  a  unique  RC  lattice  with  interpoint  distance  parameter  8  is  accomplished  by 
the  following  guidelines. 


Select  the  R  K  C  lattice  configuration  (usually  one)  with  dimensions  most  commensurate 
with  the  exterior  X  K  Y  dimensions;  i.e.,  the  one  for  which  X/Y  -  R JC  is  a  minimum  absolute 
difference  (X  £Y,  R£C).  Determine  the  uniform  interpoint  spacing  parameter 
8  =  VA/n  =  YXY/RC  as  defined  in  O’Brien  (1990b,  equation  (3)).  Next,  test  for  conformity  of 
the  dimensions  of  the  selected  lattice  to  the  study  area  dimensions  by  the  quantities  (R  - 1)8  and 
(C  - 1)8.  If  either  of  the  R,C  dimensions  is  nonconformal  (i.e.,  (R  - 1)5 1 X  or  (C  - 1)8  >  Y), 
then  conform  the  lattice  dimensions  by  adjusting  8  by  the  relation  8  =  min[X/(R  - 1),  Y/(C  - 1)]  - 
0.1.  Finally,  in  the  rarest  of  instances,  when  commensurability  is  achieved  simultaneously  by 
more  than  one  lattice  configuration,  the  researcher  should  approximate  5  as  above  for  each 
configuration,  and  then  the  R  H  C  configuration  will  be  that  associated  with  the  maximum  8 
value.  If  plural  maxima  5  occur,  select  the  R  K  C  configuration  associated  with  the  smallest 
value  of  A ,  given  in  appendix  B. 


The  symbolic  specification  of  the  above  guidelines  can  be  stated  as  follows.  Because  the 
desired  discrete  R  K  C  lattice  must  be  unique,  the  selection  mechanism  requires  a  complex 


*  Equation  (A-3)  is  not  proven  nor  could  a  proof  be  found  in  the  mathematical  literature.  Its  correctness 
seems  intuitively  obvious.  For  example,  for  a  number  to  be  square,  it  is  necessary  and  sufficient  that  all  exponents  in 
the  prime  factorization  (equation  (A-l))  be  even  (Ore,  p.  42),  which  implies  that  x(n)  is  odd,  as  is  x(n)  -  2,  but  adding 
1  (S)  makes  <t>(RC)  even.  Finally,  dividing  by  2  eliminates  the  rectangular  duplicates  in  x(n)  +  S  -  2.  The  same  logic 
applies  to  rectangular  numbers,  thus  completing  the  proof  outline. 


A-2 


two-step  procedure.  First,  the  following  commensurability  relation  is  determined  from  the 
dimensions  of  A  and  each  element  of  the  set  P: 


RkCt  =  min 

max  (X,Y) 

max  (Ri  ,Q)1 

l£i£m 

min  (X,Y) 

min  (Ri  ,Q)J 

(1  <Sk£m). 


(A-4) 


Then,  based  on  equation  (A-4)  above  and  equation  (4)  in  the  main  body  of  the  text,  8  is 
determined  from  one  of  the  following  four  mutually  exclusive  and  exhaustive  conditions: 


-“-‘l 


max 

ls/sk 


r  min 
l25ksm 


[( 


JL 


Rlc-1'Ck 


-)-o.i]> 


min 

\2£/isk 


[A  (R/QJ 


if  k  =  1  and  p  >  0  and  q  >  0  (A-5) 

if  k  =  1  and  p  £  0  or  q  <  0  (A-6) 

if  k  >  1  and  /=  1  (A-7) 

if  k  >  1  and  />  1  (A-8) 


where  p,  q  are  defined  in  equation  (4).  In  (A-5)  through  (A-8),  8  £  1  by  definition.  Also,  it  may 

be  proven  that  8  £  VA/n  based  on  equation  (4)  where  it  can  be  deduced  that  (R  - 1)8  <  X, 

(C  -  1)8  <  Y,  and  for  commensurate  lattices  (8  £  YA/r),  6  =  X/R  =  Y/C.  This  relationship  places 
an  upper  bound  on  8  that  is  important  in  tl<c  proofs  and  derivations  of  the  text 


Figure  A-l  summarizes  the  algorithm  tor  the  RC  Lattice  selection  and  computation  of  8. 

In  summary,  if  k  =  1,  RkCk  is  the  lattice  selected  from  equation  (A-4)  and  8  is  selected  from 
equation  (A-5)  or  equation  (A-6).  If  k  >  1, 8  is  selected  from  equation  (A-7)  and  R  H  C  is 
selected  as  the  lattice  associated  with  the  maximum  8  in  equation  (A-7).  Finally,  if  (A-7) 
provides  a  plurality  of  8  values,  then  (A-8)  is  used,  which  selects  the  R^Cy  (2  £/<  k)  lattice 
associated  with  the  smallest  A  value.  Appendix  B  contains  the  required  A  values  computed  to 
five  decimal  places.  Note  that  for  a  unit  lattice,  or  commensurate  nonunit  lattice,  k  =  1  and 
equation  (A-5)  computes  the  correct  8.  Hansen  et  al.  (1953,  Vol.  I)  provides  an  interesting 
discussion  of  commensurate  nonunit  lattices  related  to  a  square-root  law  for  distances  in  the  field 
of  discrete  finite-population  sampling  theory  when  equation  (A-5)  applies. 


Thus,  equations  (A-4)  through  (A-8)  provide  a  unique,  conforming  lattice  with  associated 
interpoint  distance  parameter  8.  A  table  of  prime  numbers  and  factorizations  of  composite 
numbers  is  an  indispensable  tool  for  implementing  equation  (A-4).  See  Lehmer  (1941, 1961)  for 
extensive  tables  and  Abramowitz  and  Stegun  (1964)  for  abbreviated  t  *bles. 
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These  calculations  assure  that  the  lengths  of  the  R  and  C  line  segments  of  the  nonunit 
lattice,  (R  - 1)6  and  (C  - 1)5,  containing  human  density  points  do  not  exceed  the  dimensions  of 
the  study  area.  The  utility  of  adjusting  5  (when  so  required)  as  recommended  resides  in  plotting 
minimum/maximum  dispersions  of  the  RC  density  points  in  die  study  area  as  given  in  equations 
(6)  and  (10). 
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Figure  A-l.  Flowchart  for  Determining  RhC  Unit  Lattice  and  Interpoint  Distance  Parameter  5 
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NUMERICAL  EXAMPLES 


Three  artificial  examples  are  selected  to  demonstrate  the  procedures.  A  complete  setup  is 
provided.  The  flowchart  in  figure  A- 1  is  useful  in  tracing  the  decision  logic. 

In  the  first  example,  the  data  are  as  follows:  n  =  12,  A  =  25  x  25  ft^.  It  is  obvious  that 
n  =  12  provides  two  nontrivial,  nonredundant  choices  (0(12)  =  2);  viz.,  RiCi  =  6  x  2  or 
r2C2  =  4x3.  Because  X/Y  =  1,  R2C2  =  4x3  comes  closest  to  satisfying  equation  (A-4). 
Because  k  =  1,  first  compute  8  =  7.22  (from  equation  (A-5));  R,C  is  found  to  be  conformal  (each 
row/column  “fits”  inside  the  outside  25  fi?  area  in  accord  with  equation  (4)).  Thus,  R  =  4,  C  =  3, 
and  5  =  7.22. 

In  the  second  example,  n  =  64  and  A  =  50  x  5  ft^.  This  example  is  one  of  those  rare 
possibilities.  For n  =  64,  t(64)  =  7;  0(RC)  =  [7  +  1  -  2]/2  =  3,  and  P  =  {32  x  2, 16  x  4, 8  x  8). 
Applying  equation  (A-4)  shows  that  32  x  2  and  16  x  4  are  equally  commensurate  (k  >  1);  i.e., 

|10  - 16)  =  |10  -  4|.  Thus,  because  k  =  2  and  /is  undetermined,  apply  equation  (A-7),  giving  8  = 
max  (1.57, 1.51 }  =  1.57(/=  1).  The  configuration  associated  with  the  largest  8  value  is 
16  k  4.  Thus,  R  =  16,  C  =  4,  and  8  =  1.57  for  this  data  distribution. 

As  an  example  requiring  equation  (A-8)  for  determining  RhC  and  8,  consider  the  data: 
A  =  80  x  16  ft2,  n  =  32,  P  =  { 16  x  2, 8  x  4}.  Here,  applying  (A-4)  to  the  above  data  distributions 
produces  |5  -  8|  =  |5  -  2|  (i.e„  k  =  2),  and  (A-7)  produces  8  =  max  { (5.23, 5.23)}  (/>  1),  which  is 
clearly  ambiguous.  But  min  [A  (16  x  2),  A  (8  x  4)]  =  min  (5.59, 3.27)  =  A  (8  x  4).  Thus,  R  K  C 
=  8  H  4,  and  8  =  5.23. 

In  general,  the  reader  will  note  that  (A-7)  or  (A-8)  will  be  required  for  determining  8 
whenever  the  study  area  ratio  X/Y  is  equal  to  the  average  of  die  ratios  of  two  equally 
commensurate  lattices.  The  above  examples  bear  out  this  relationship. 
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APPENDIX  B 

UNIT  LATTICE  AVERAGE  EUCLIDEAN  DISTANCE  VALUES 
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